Exact Computation of Coalescent Likelihood under the Infinite Sites Model
نویسنده
چکیده
Coalescent likelihood is the probability of observing the given population sequences under the coalescent model. Computation of coalescent likelihood under the infinite sites model is a classic problem in coalescent theory. Existing methods are based on either importance sampling or Markov chain Monte Carlo. In this paper, we develop a simple method that can compute the exact coalescent likelihood for many datasets of moderate size, including a real biological data whose likelihood was previously thought to be difficult to compute exactly. Simulations demonstrate that the practical range of exact coalescent likelihood computation is significantly larger than what was previously believed.
منابع مشابه
Exact Likelihood Calculation under the Infinite Sites Model
A key parameter in population genetics is the scaled mutation rate θ = 4Nμ, where N is the effective haploid population size and μ is the mutation rate per haplotype per generation. While exact likelihood inference is notoriously difficult in population genetics, we propose a novel approach to compute a first order accurate likelihood of θ that is based on dynamic programming under the infinite...
متن کاملExact coalescent likelihoods for unlinked markers in finite-sites mutation models
We derive exact formulae for the allele frequency spectrum under the coalescent with mutation, conditioned on allele counts at some fixed time in the past. We consider unlinked biallelic markers mutating according to a finite sites, or infinite sites, model. This work extends the coalescent theory of unlinked biallelic markers, enabling fast computations of allele frequency spectra in multiple ...
متن کاملTopologies of the conditional ancestral trees and full-likelihood-based inference in the general coalescent tree framework.
The general coalescent tree framework is a family of models for determining ancestries among random samples of DNA sequences at a nonrecombining locus. The ancestral models included in this framework can be derived under various evolutionary scenarios. Here, a computationally tractable full-likelihood-based inference method for neutral polymorphisms is presented, using the general coalescent tr...
متن کاملStatistical tests of the coalescent model based on the haplotype frequency distribution and the number of segregating sites.
Several tests of neutral evolution employ the observed number of segregating sites and properties of the haplotype frequency distribution as summary statistics and use simulations to obtain rejection probabilities. Here we develop a "haplotype configuration test" of neutrality (HCT) based on the full haplotype frequency distribution. To enable exact computation of rejection probabilities for sm...
متن کاملCoalescent: an open-science framework for importance sampling in coalescent theory
Background. In coalescent theory, computer programs often use importance sampling to calculate likelihoods and other statistical quantities. An importance sampling scheme can exploit human intuition to improve statistical efficiency of computations, but unfortunately, in the absence of general computer frameworks on importance sampling, researchers often struggle to translate new sampling schem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009